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METHOD AND APPARATUS FOR ACTIVE 
REDUCTION OF SPEAKERPHONE SINGING 

5 Field of the Invention: 

This invention relates to acoustic feedback in a communications 
device and specifically to speakerphone station sets and particularly to 
reduction of singing caused by feedback of a speaker output to the 
station set microphone. It also relates, in general, to any system in 

10 which audio output of a speaker may feed back into a microphone of 
the system causing singing (positive feedback) to occur. It specifically 
concerns a method and apparatus for determining the level of acoustic 
energy due to the output of a speaker appearing at a microphone of the 
communication device and to identifying such feedback energy as 

15 differentiated from that of the spoken input to the microphone. 
Background of the Invention: 

The amount of acoustic energy output of a speaker being fed 
back into a microphone of a duplex acoustic system with gain (i.e., a 
device used for communication purposes) determines the system 

20 acoustic stability. Such stability is important to prevent the generation 
of "singing" in which feedback of the speaker output onto the 
microphone causes reinforcement of sound from the loudspeaker and 
thus causes the speaker to emit a howl or similar high-pitched noise. 

There are existing methods of preventing this singing effect that 

25 operate by inserting switched loss into either the speaker or microphone 
path to ensure system stability. The amount of switched loss to insert is 
determined by comparing the microphone signal level to the speaker 
signal level from the network via a hybrid connected to the 
speakerphone. Examination of the relative levels of the two signals 
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permits a determination as to which signal level is presently active (i.e. 
speaker output or voice input). Loss is inserted in the path which is 
determined to be presently inactive ensuring that the total loop electro- 
acoustic gain of the speakerphone and the network is less than one at 
5 the frequency where at zero degrees loop phase shift is experienced. 
This criterion, known as the Nyquist stability criterion, determines how 
much loss must be present in the electro-acoustic loop consisting of the 
speakerphone and the network to sustain oscillations, in order to ensure 
stability. The overall loss inserted, in many arrangements, to maintain 

10 stability is related to the sum of signal-dependent switched loss and 
some fixed loss amount, which is needed to provide "sing" margin to 
compensate for inaccuracies in determination of the total amount of 
loop gain necessary to prevent oscillations at specific frequencies. 

The difficulty of these implementations has been in determining 

15 the amount of coupling which exists between the speakerphone 's 

speaker and its microphone (i.e., speaker output vs. voice input). The 
acoustic environment between speaker and microphone is often 
unstable making a determination between speaker feedback and voice 
input to the microphone difficult to assess. In another arrangement, it 

20 has been thought possible to have the relative signal levels determined 
at the hybrid connection of the speakerphone to the telephone network. 
It is theoretically possible to sample incoming and outgoing speech at 
the hybrid connecting the phone to the network to infer loop gain, but 
this method has difficulties due to the isolation loss of the hybrid and is 

25 often unsatisfactory 

Summary of the Invention: 

In an exemplary embodiment of the invention, identification of 
signals (i.e., voice input or speaker output) in a process for reducing 
acoustic feedback, in a communication device, is accomplished by 
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adding a signature noise (i.e., an identification mark) to output signals 
radiated by the speaker to enable these signals to be separated from 
speech input to the microphone. Having identified the signal (i.e., 
speech output) likely to cause a "singing" phenomenon, appropriate 
5 insertion loss to reduce the feedback may be added to the appropriate 
speech path within the communication device, to reduce a probability 
of singing. 

In the exemplary embodiment of the invention, the signature 
noise, applied to the speech output, comprises a psuedo-noise signal 
10 consisting of a digitally generated sequence (i.e., a PN sequence). The 
envelope of the speech signal fed to the loudspeaker modulates this PN 
sequence. 

The "signature" (i.e., PN sequence) added to speech issuing 
from the loud speaker identifies it in contrast to voice speech input to 

15 the microphone allowing it to be used to assist in any loss-switching 
process. In creating the signature, the speech output of the loudspeaker 
is combined with a pseudo-noise signal waveform consisting of a 
digitally generated sequence. The envelope of the speech that is fed to 
the loudspeaker modulates the PN signal. As such, it represents a low- 

20 level, "background" pink noise signal whose amplitude is proportional 
to the envelope of the speech that issues from the loudspeaker. 

The speech input to the microphone is correlated with a version 
of the PN sequence, such that the correlated result is in direct 
proportion to the amount of speech sampled by the microphone issuing 

25 from the loudspeaker. Voice input to the microphone does not contain 
the PN sequence and its level may be separately ascertained. As part of 
the PN detection process the voice input speech is largely ignored so as 
to be independent from the PN correlation output. For wideband 
acoustic systems, the technique may be applied with pink noise 
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"bands", which utilize separate PN sequences. In such an embodiment, 
separate correlators may be used to adjust loss in various portions of 
the audio pass band to effect stability control, minimizing degradation 
of the entire program content due to feedback in only one portion of the 

5 pass band. 

A second PN sequence may also be used to characterize the 
acoustic coupling path between the speaker and microphone. This 
second PN sequence would be made orthogonal to the first PN 
sequence in order to avoid interference between the two, and would be 

10 sent at a constant level through the loudspeaker. This second PN 
sequence would then be received by the microphone and correlated 
against the transmitted sequence to determine the impulse response of 
the acoustic path. This impulse response is then used to control an 
acoustic echo canceller. The advantage of using a PN sequence in 

15 addition to human speech in an acoustic echo canceller is that the PN 
sequence is a broadband signal and, hence, more accurately probes the 
acoustic environment. 
Brief Description of the Drawing: 

The sole figure is a block schematic of a speakerphone suitable 

20 for practicing the principles of the invention. 
Detailed Description: 

A speakerphone 101, as illustrated in the drawing, includes 
processing modules enabling practice of a method of identifying 
speaker output signals in accord with the principles of the invention. 

25 The illustrative speakerphone is attached to the network through a 
hybrid 103. Hybrid circuits are well known in telephony and further 
discussion is not believed necessary. Input into and output from the 
speakerphone is by the loudspeaker 131 and microphone 133, 
respectively. Incoming signals from the telephone network are 
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transmitted from the hybrid 103 to an envelope detector 105 to convert 
the incoming signals to a slowly varying voltage level that follows the 
energy of the incoming signal. The output of the envelope detector 105 
is applied to a gain cell 113 (multiplier) and coupled to the loudspeaker 

5 13 1 via an audio amplifier 123. Output signals are also routed to 

directly couple the hybrid 103 to another gain cell (multiplier) 115 that 
is also coupled to the amplifier 123 and loudspeaker 131. Those skilled 
in the art will observe that the analog processing functions described 
illustratively above could be performed by digital signal processing 

10 means. 

An exemplary embodiment contemplates a speakerphone 
wherein a loss-control processor is used to effect switched loss in either 
the transmit or receive path via the gain cells. This loss control 
processor is responsive to sensed envelopes of incoming (from the 

15 network) speech, as well as outgoing (to the network) speech, and the 
filtered correlator output. 

In accord with the invention, a PN sequence generator 1 1 1 is 
connected to apply the PN sequence to the gain cell 113 where it is 
modulated by the envelope of the incoming speech. In one exemplary 

20 embodiment, a minimum (non-zero) amount of PN noise is applied 
even when incoming speech is not present. This level ensures that the 
system will be able to prevent buildup of feedback in situations where 
neither incoming nor outgoing speech is present. Since the PN noise is 
low-level, it will not seriously degrade the quality of the incoming 

25 speech as it issues from the loudspeaker nor will it pose objectionable 
"standby" noise. 

Incoming voice-generated speech (i.e., human- generated input 
speech) is coupled from microphone 133 to amplifier 125 and coupled 
to gain cell (multiplier) 117. An envelope detector 121 is coupled to 
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detect the envelope of the amplified speech output of the microphone 
133. A gain cell (multiplier) 117 connects amplifier 125 to the hybrid 
circuit 103, effecting a means of inserting loss into the transmit path to 
the network. 

5 The output of amplifier 125 is also connected to a correlator 

circuit 119. Correlator circuit 1 19 is also connected to receive the PN 
sequence from PN sequence generator 111. The correlator 119 output 
identifies speaker output fed back to the microphone. The correlator 
output is filtered in filter 109 to band limit it to generate a signal 

10 reflective of the amount of feedback signal from the loudspeaker. This 
reflective signal is applied to a loss control processor 107. Both 
envelope detectors 121 and 105 have outputs applied to loss control 
processor 107 whose function is to determine the amount of switched 
loss to be applied to reduce the speakerphone-network loop gain to less 

15 than one according to the Nyquist stability criterion. 

The loss control processor 107 is responsive to envelope 
detector 105 and 121 representing input and output signals and to the 
filtered correlator output. Its (i.e., Correlator 119) function is to 
correlate the microphone signal output with a version of the PN 

20 sequence so that the output of the microphone, due to speech input, is 
differentiated from speech output. Hence, loss control is activated in 
direct proportion to the amount of speech sampled by the microphone 
that has issued from the loudspeaker. Voice input speech applied to the 
microphone will not contain the PN sequence. Accordingly, the speech 

25 input signal level may be accurately determined. In operation such as 
is contemplated in the exemplary embodiment, the human speech will 
be ignored by the correlator, because it contains no significant PN 
sequence content.. The information representing the differential speech 
characteristics is coupled to the loss control processor 107 that 



6 



Express Mail Label ER 240486263 US 



determines the loss to be applied to the speech circuits. The loss 
control processor 107 may be a stored program control processor 
programmed in software to perform the specified function. No detailed 
structural description is believed necessary, since stored program 

5 processors have a standard structure. 

Since the PN sequence spreads the frequency content of the 
noise over the audio pass band of the speaker, uneven frequency 
response of the loudspeaker or microphone, which partly determines 
the extent of loudspeaker-microphone coupling will pose less of an 

10 influence on the accuracy of the feedback determination than with 
conventional systems. It should be noted that movement of a 
speakerphone within an enclosed sound-field environment could cause 
violent changes in speaker/microphone coupling. The ability of the 
PN-pilot technique to detect these changes in real time can reduce the 

15 need to apply large "singing" safety margins that tend to make 
speakerphone conversations less "transparent". 

A feature of the described method is its ability to extend the 
process to characterize the acoustic channel for purposes of echo 
cancellation or equalizing the loudspeaker response to improve audio 

20 quality. In one arrangement a second PN sequence may be added for 
adjusting echo-canceling circuitry. 

It is readily apparent that the foregoing technique may reduce 
the disruptive effects of a condition of "double talk", where high 
speech levels are produced at the microphone from both incoming 

25 speech and feedback speech input. Such cases occur when the user of 
the speakerphone attempts to interrupt continuous speech arriving from 
the distant talker. In this instant, PN sequence levels are detected and 
appropriate supplemental loss may be is inserted into the loudspeaker 
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path to ensure that the speakerphone user will be heard by the distant 
party as an "interrupt". 

While the exemplary embodiment has been presented in terms 
of a speakerphone, it is to be understood that any communications 

5 device combining speech input and speech output, may advantageously 
utilize the described invention. For example, a small handheld device 
could very easily experience acoustic feedback between an output 
speaker (i.e., not a loud speaker here) and an input microphone due to 
close proximity of the two and because a handheld speakerphone is 

10 subject to movement in a near-field environment which may contain 
reflective objects capable of materially changing speaker-microphone 
coupling at various frequencies. Other embodiments, permitting 
practice of the invention, will be readily apparent to those skilled in the 
art, for example suppression of public address system feedback. 
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